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ABSTRACT 

The H1 linker histones are abundant chromatin- 
associated DNA-binding proteins. Recent evidence 
suggests that linker histones also may function 
through protein-protein interactions. To gain a 
better understanding of the scope of linker histone 
involvement in protein-protein interactions, we used 
a proteomics approach to identify H1 -binding 
proteins in human nuclear extracts. Full-length 
H1.0 and H1.0 lacking its C-terminal domain (CTD) 
were used for protein pull-downs. A total of 107 can- 
didate H1.0 binding proteins were identified by 
LC-MS/MS. About one-third of the M.O-dependent 
interactions were mediated by the CTD, and two- 
thirds by the N-terminal domain-globular domain 
fragment. Many of the proteins pulled down by 
H1.0 were core splicing factors. Another group of 
H1 -binding proteins functions in rRNA biogenesis. 
H1.0 also pulled down numerous ribosomal 
proteins and proteins involved in cellular transport. 
Strikingly, nearly all of the H1.0-binding proteins are 
found in the nucleolus. Quantitative biophysical 
studies with recombinant proteins confirmed 
that H1.0 directly binds to FACT and the splicing 
factors SF2/ASF and U2AF65. Our results demon- 
strate that H1.0 interacts with an extensive 
network of proteins that function in RNA metabol- 
ism in the nucleolus, and suggest that a new 
paradigm for linker histone action is in order. 

INTRODUCTION 

The chromatin of most mammalian tissues consists of 
arrays of nucleosomes spaced at ~160-210-bp intervals, 



together with 0.6-1.0 HI linker histones per nucleosome 
(1). Six major linker histone isoforms are expressed 
in human somatic cells (2). The linker histones of higher 
eukaryotes have a short (13-40 residue) unstructured 
N-terminal domain (NTD), a central well-ordered 
globular domain (GD) and a ~ 100 residue unstructured 
C-terminal domain (CTD). Stabilization of the condensed 
states of chromatin is the most common function 
attributed to linker histones (3,4). The role of the NTD 
in chromatin condensation is unknown. The GD is a 
winged helix DNA-binding domain (5,6) that mediates 
the interaction of linker histones with nucleosomes (7). 
The intrinsically disordered CTD (8) interacts with the 
linker DNA that connects adjacent nucleosomes in a chro- 
matin fiber (9) and is required for the chromatin 
condensing functions of linker histones (10,11). 

In addition to their role in regulating chromatin archi- 
tecture via protein-DNA interactions, linker histones 
interact with non-histone proteins. A recent literature 
survey uncovered reports scattered during 20-year period 
for 16 linker histone-binding proteins (4). In most of these 
cases, the interactions were identified using co-IP and/ 
or pull-down assays for the specific linker histone- 
protein interaction in question. Two recent studies 
used a more systematic approach to probe for linker 
histone-protein interactions. In one case (12), anti-Hl 
antibodies were used to co-IP Hl-protein complexes 
from Drosophila Kc cells. A number of ribosomal 
proteins were identified that, together with HI, were 
involved in specific gene repression. In the second study 
(13), a cell line expressing a tandem tagged version of the 
HI. 2 isoform was used together with chromatography 
and glycerol gradient ultracentrifugation to isolate a 
stable complex containing HI. 2, specific ribosomal 
proteins, transcriptional repressors (e.g. YB1, PARP1 
and PURa) and transcriptional activators (e.g. WDR5, 
CAPERa and nucleolin). 
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The scope of linker histone participation in protein- 
protein interactions currently is unknown, i.e. are Hi- 
protein interactions relatively isolated occurrences, or 
are they more common than currently believed? 
Moreover, the extent to which each of the linker histone 
domains mediates protein-protein interactions is unclear. 
This is an intriguing question given that the GD and the 
CTD both are well-established DNA-binding domains. 
Here, we have used the HaloTag methodology (14) 
together with mass spectrometry to identify candidate 
HI -binding proteins in nuclear extracts prepared from 
human cell lines. Both full-length H1.0 and H1.0 lacking 
its CTD were used as the ligands in these experiments, 
allowing determination of interactions that were depend- 
ent on the CTD and NTD-GD fragment. We have 
identified 107 proteins that were pulled down by H1.0 
under the conditions of our experiments. The CTD and 
the NTD-GD fragment each were found to mediate 
multiple protein-protein interactions. Thirty-three of the 
proteins pulled down by H1.0 were splicing factors. H1.0 
also pulled down numerous proteins involved in rRNA 
biogenesis, ribosome function/translation and cellular 
transport. Ninety-four of the proteins identified in our 
studies are found in the nucleolus. The interactions 
between H1.0 and U2AF65, SF2/ASF and FACT were 
validated by quantitative binding experiments with pure 
recombinant proteins. Our results indicate that histone 
H1.0 is a central component of a large network of 
protein-protein interactions involved in RNA metabolism 
in the nucleolus. 



MATERIALS AND METHODS 

Protein expression and purification 

To prepare the HaloTag fusion proteins, cDNAs encoding 
full-length mouse H1.0 (residues 1-194) and H1ACTD 
(residues 1-97) were cloned into the pFN18K T7 Flexi 
vector (HaloTag 7, Promega) via the Pmel and Sgfl re- 
striction sites. The HaloTag-only control expression 
vector was constructed following manufacturer's instruc- 
tions, using annealed oligonucleotides 5'-CGCGTAAGG 
GTAGGTTT and 5-AAACCTACCCTTACGCGAT to 
ligate the digested pFN18 plasmid. All constructs were 
confirmed by DNA sequencing. pFN18-Halo-H1.0, 
pFN18-Halo-HlACTD and pFN18-Halo were trans- 
formed into Escherichia coli BL21(DE3) pLysS-competent 
cells. The transformed cells were grown at 37°C in LB 
medium to an OD 600 of 0.6. After induction with 
0.4 mM isopropyl-l-thio-D-galactopyranoside, cells were 
grown for 30min, rifampicin was added to a concentra- 
tion of 50ug/ml and cells were grown for >2h. The cells 
were then harvested and washed with 10 mM Tris, pH 8.0, 
and 100 mM NaCl. Cell pellets were sonicated in three 
volumes of Halo buffer [25 mM Tris, pH 8.0, 0.5 mM 
ethylenediaminetetraacetic acid (EDTA), lOOmM NaCl, 
0.5 mM phenylmethylsulfonyl fluoride, 1.0 mM benz- 
amidine and protease inhibitor cocktails II and III 
(Calbiochem)], and the HaloTag proteins were purified 
on HiTrap SP-FF followed by MonoQ (for HaloTag 
alone) (GE Healthcare). The proteins were dialyzed 



overnight against Halo buffer and stored at 4°C. 
Concentration was defined using a BCA protein assay 
kit (Thermo scientific). 

pET30a-U2AF65 and pET24b-SF2/ASF (encoding for 
residues 11-196 of SF2/ASF) were transformed into 
E. coli BL21(DE3) pLysS-competent cells and expressed 
as described earlier in the text. Cell pellets were isolated as 
described previously. Cell pellets were resuspended in 
50 mM potassium phosphate buffer (pH 7.4), 900 mM 
NaCl, lOmM imidazole and loaded onto a 5-ml 
HisTrap HP column (GE Healthcare). Proteins were 
eluted from the HisTrap column with a 1 0-500 mM 
linear imidazole gradient. Peak fractions were loaded on 
a 15% sodium dodecyl sulfate (SDS) polyacrylamide gel 
to analyze for purity and/or degradation, then pooled and 
dialyzed into lOmM Tris, pH 7.5, 300mM NaCl and 
0.1 mM PMSF. FACT was expressed purified as described 
in the study conducted by Winkler et al. (15). 

Preparation of nuclear extracts 

Nuclear extracts were prepared using modified Dignam 
protocol (16). Four different human cell lines were used 
as follows: CEM, HeLa, U20S and RPE-1. Cells were 
grown, harvested, spun and washed in phosphate- 
buffered saline (137mM NaCl, lOmM phosphate, 
2.7 mM KC1, and a pH of 7.4), resuspended in five 
packed cell volumes of buffer A (lOmM HEPES, pH 
7.9, 1.5 mM MgCl 2 and lOmM KC1), incubated on ice 
for 10 min and spun at 2000 rpm for lOmin at 4°C. The 
cell pellet was resuspended in two original packed cell 
volumes, transferred to a Dounce homogenizer and 
lysed with 20-60 strokes depending on cell type. 
Aliquots of cells were stained with DAPI (6-diamidino- 
2-phenylindole) and checked for >90% lysis using light 
microscopy. Lysed cells were spun at 2000 rpm for 
10 min to pellet nuclei, the supernatant was removed and 
pellet was re-spun at 17 000 rpm for 1 min to isolate 
remained cytoplasm. The pellet was resuspended in one 
packed nuclear volume of high-salt buffer (20 mM 
HEPES, pH 7.9, 25% v/v glycerol, 0.42 M NaCl, 
1.5 mM MgCl 2 and 0.2 mM EDTA), the mixture stirred 
gently and rotated for 30 min in a cold room. Aliquots of 
nuclei were stained with DAPI and checked for complete 
lysis. The lysed nuclei were spun at 17 000 rpm for 30 min 
at 4°C and the resulting nuclear extract dialyzed against 
buffer D (20mM HEPES, pH 7.9, 20% glycerol, 100 mM 
KC1, 0.2 mM EDTA, 0.5 mM DTT and 0.2 mM PMSF) 
for 5h. The protein concentrations of the extracts were 
determined using a BCA protein assay kit (Thermo scien- 
tific). Nuclear extracts were flash-frozen in liquid nitrogen 
and kept at -80°C. 

Pull-downs 

A slurry (300 ul) of HaloTag beads (Promega) was pre- 
equilibrated with Halo buffer and then incubated with 
0.4 mg of HaloTag-Hl.O fusion proteins or the HaloTag 
protein alone in the presence of 0.05% Igepal-40 and 
protease inhibitors [PIC Sets II and III (Calbiochem), 
0.5 mM benzamidine, lOug/ml of tosyl phenylalanyl 
chloromethyl ketone (TPCK) and 1 mM 4-(2-aminoethyl) 
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benzenesulfonyl fluoride hydrochloride (AEBSF)] with 
rocking overnight at 4°C. The beads were extensively 
washed with Halo buffer containing 250 mM NaCl and 
then equilibrated with Halo buffer containing 150 mM 
NaCl. Nuclear extract (0.5 mg of total protein) was 
added to the beads, the volume adjusted to 500 ul with 
Halo buffer, followed by incubation overnight. For the 
RNAse experiments, a portion of the nuclear extracts 
was incubated with O.lmg/ml of RNAse A (Thermo 
Scientific) and 7.5 U of exonuclease T (New England 
Biolabs) in the presence of 10 mM MgCl 2 and 1 mM 
DTT for 30min at room temperature, before incubation 
with the beads. The unbound proteins were removed by 
washing three times with Halo buffer containing 250 mM 
NaCl. Bound proteins were eluted from the beads by 
boiling in 1M NaCl and 1% SDS. The pulled down 
proteins were precipitated with 20% trichloroacetic acid, 
followed by centrifugation for 30min at 14 000rpm. The 
protein pellet was washed with 100% acetone twice and 
dissolved in Laemmli loading buffer (60 mM Tris-Cl, pH 
6.8, 2% SDS, 10% glycerol, 5% (3-mercaptoethanol and 
0.01% bromophenol blue). For LC-MS/MS analysis, 
proteins were electrophoresed on a 4-12% pre-cast poly- 
acrylamide gel (Invitrogen) and stained with Imperial Blue 
(Thermo Scientific). Proteins were run ~4mm into the 
resolving gel, and the entire stained region was excised 
and split into three fractions. Gel fractions were submitted 
to the Proteomics and Metabolomics Facility at Colorado 
State University for in-gel digestion and LC-MS/MS 
analysis. For western blots, the proteins were electro- 
phoresed and transferred to a nitrocellulose membrane 
as described later in the text. 

In-gel digestion 

Gel fractions were washed with water and destained with 
200 ml of acetonitrile (ACN) and 50 mM ammonium bicar- 
bonate (AmBIC) (1:1 v/v) at 60°C. Destained gel pieces 
were then dehydrated with 200 ul of 100% ACN with sub- 
sequent rehydration in 200 ul of 25mM DTT in 50 mM 
AmBic (incubated for 20min at 60°C). Alkylation of 
reduced cysteine was performed by the addition of 200 ul 
of 55 mM iodoacetamide in 50 mM AmBic (incubated for 
20min in dark at room temperature). Reduced and 
alkylated gel pieces were washed and dehydrated again 
with 100% ACN followed by rehydration with 20 ul of 
12ng/ml of trypsin in 0.01% ProteaseMAX™ surfactant 
(Promega Corporation). After lOmin, an additional 30 ul 
of 0.01% ProteaseMAX surfactant was added and diges- 
tion allowed proceeding for 1 h at 50°C. Digestion was 
stopped with the addition of 2.5 ul of 10% triflouroacetic 
acid. The supernatant containing the peptides was dried in a 
Speed Vac vacuum centrifuge and reconstituted in 10 ul of 
3% ACN and 0.1% formic acid. 

Mass spectrometry 

Peptides from the digested gel fractions were purified 
and concentrated using an on-line enrichment column 
(Agilent Zorbax C18, 5 um, 5 x 0.3 mm). Subsequent chro- 
matographic separation was performed on a reverse phase 
nanospray column (Agilent 1 100 nanoHPLC, Zorbax C18, 



5 um, 75 um ID x 150-mm column) using a 90min linear 
gradient from 25-55% buffer B (90% ACN and 0.1% 
formic acid) at a flow rate of 300nl/min. Peptides were 
eluted directly into the mass spectrometer (Thermo 
Scientific LTQ linear ion trap), and spectra were collected 
over a m/z range of 200-2000 Da using a dynamic exclusion 
limit of two MS/MS spectra of a given ion for 30 s (exclu- 
sion duration of 90s). Compound lists of the resulting 
spectra were generated using Bioworks 3.0 software 
(Thermo Scientific) with an intensity threshold of 5000 
and 1 scan/group. MS/MS spectra were searched against 
the Uniprot protein database (2 September 2012) using a 
taxonomy filter for 'Homo sapiens' concatenated with 
reverse sequences for determination of the peptide FDR 
(148 254 sequence entries) using the Mascot database 
search engine (version 2.3). Search parameters were as 
follows: average mass, parent ion mass tolerance of 2 Da, 
fragment ion mass tolerance of 1.5 Da, fully tryptic 
peptides with one missed cleavage, variable modification 
of oxidation of M and fixed modification of carbamido- 
methylation of C. 

Mascot search results for each independently analyzed 
gel fraction were imported and combined for each pull- 
down using probabilistic protein identification algorithms 
implemented in Scaffold software (Proteome Software, 
Portland, OR, USA). Peptide and protein probability 
thresholds of 95 and 99%, respectively, and a minimum 
of two unique peptides, were applied to the results (<0.2% 
peptide FDR as calculated by Scaffold based on matches 
to reverse hits). Proteins containing shared peptides were 
grouped by Scaffold to satisfy the laws of parsimony. 
Manual validation of MS/MS spectra was performed for 
all protein identifications above the probability thresholds 
that were based on only two unique peptides. Criteria for 
manual validation included the following: (i) minimum of 
80% coverage of theoretical y or b ions (at least five in 
consecutive order); (ii) absence of prominent unassigned 
peaks >5% of the maximum intensity; and (hi) indicative 
residue specific fragmentation, such as intense ions 
N-terminal to proline and immediately C-terminal to as- 
partate and glutamate (used as additional parameters of 
confirmation). 

Data analysis was performed separately for each experi- 
mental replicate. Proteins were determined to be candidate 
HI -binding partners if they were observed in at least one 
cell type in both experimental replicates and were not 
observed in any replicate of the HaloTag controls. 
Proteins were further determined to be CTD dependent 
if they were observed in at least one cell type, specifically 
in wild-type HI pull-downs, in both experimental repli- 
cates and were not observed in any replicate of either 
the Halo controls or in CTD mutant pull-downs. 
Common contaminant proteins, including keratins and 
trypsin (autolysis), were discarded. We have also 
excluded H1.0 because it might result from recombinant 
proteins eluted from beads during pull-downs. 

Western blotting 

Hl.O-binding proteins were pulled down from CEM nuclei 
as for the proteomics analysis. The sample was 
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electrophoresed on a 4-12% gradient SDS polyacrylamide 
gel and transferred to a nitrocellulose membrane. 
Membranes were incubated in 5% non-fat milk in a phos- 
phate-buffered saline solution containing 0.05% Tween 
(PBS-T) for 1 h and then probed with rabbit 
anti-RbAp48 antibodies (Novus, 1:2500 56483) overnight. 
After a series of washes in PBS-T, the membranes were 
re-probed with IRDye 680 goat anti-rabbit antibodies 
(1:20000; 926-32221, LI-COR) for 1 h. The proteins were 
visualized using an Odyssey (LI-COR). Western blots 
were performed at least three times. CEM nuclear 
extract was used as a positive control. 

Fluorescence quenching microplate titration assay 

Clear bottom 3 84- well microplates were prepared as pre- 
viously described (17). Binding experiments were set-up by 
diluting a high-concentration stock of the unlabeled 
titrant (U2AF65, SF2/ASF or FACT) into a series of con- 
centrations ranging from 1 nM to 10 uM. The reaction 
conditions consisted of 20 mM Tris, pH 7.5, 150mM 
NaCl, 5% glycerol, 2mM Tris(2-carboxyethyl)phosphine 
(TCEP) and 0.01% of both 3-[(3-cholamidopropyl)- 
dimethylammonio]-l-propanesulfonate (CHAPS) and 
octylglucoside detergents. The presence of low amounts 
of both ionic and non-ionic detergents minimizes 'sticking 
issues' common at low concentrations and significantly 
increased reproducibility. Histone H1.0 was conjugated 
with Oregon Green 488 via an iodoacetimide linkage at 
residue 19. Labeled H1.0 was kept at a constant concen- 
tration between 1 and 10 nM. The mixtures (40 ul) were 
allowed to equilibrate for 30min at room temperature in 
the dark before scanning at the appropriate wavelength 
using a Typhoon 8600 variable mode fluoroimager. The 
fluorescence intensities were quantified using the program 
ImageQuant TL, and data were analyzed using 
Kaleidagraph version 3.6 (Synergy software). All experi- 
ments were performed in replicative quadruplicate. 

RESULTS 

Proteomics identification of candidate Hl.O-binding 
proteins 

To catalog the HI -binding proteins present in human 
nuclear extracts, we performed pull-down experiments 
using the HaloTag system followed by mass spectrometry 
to identify the isolated proteins. The HaloTag protein 
is a mutant dehalogenase DhaA from Rhodococcus 
rhodochrous, which forms a covalent non-hydrolyzable 
intermediate on incubation with a modified hydrocarbon 
substrate (14). We first engineered chimeric HaloTag-Hl.O 
proteins by cloning the cDNAs encoding full-length 
mouse H1.0 (residues 1-194) and mouse H1.0 lacking its 
CTD (residues 1-97, designated H1ACTD) into the 
HaloTag expression vector. Residues 1-97 (corresponding 
to the NTD-GD fragment) of mouse H1.0 are 100% iden- 
tical to human H1.0, whereas residues 98-194 (corres- 
ponding to the CTD) are 94% identical. Chimeric 
proteins were purified from E. coli and incubated with a 
sepharose-bound HaloTag substrate to covalently attach 
the proteins to sepharose beads. 



Pull-downs were performed in parallel with nuclear 
extracts prepared from four different human cell lines 
(HeLa, CEM, RPE-1 and U20S). Nuclear extracts were 
incubated with sepharose-bound HaloTag-Hl.O proteins 
or the HaloTag protein alone in buffer containing 
150mM NaCl. After extensive washing with buffer con- 
taining 250 mM NaCl, bound proteins were eluted from 
the sepharose beads and run on a SDS-polyacrylamide 
gel. Excised gel slices were digested with trypsin, and the 
proteolyzed samples were analyzed by LC-MS/MS. 
Candidate Hl.O-binding proteins were identified in one of 
two ways. In our initial analysis, two pull-downs were per- 
formed for each of the four cell types. To be classified as a 
Hl.O-binding protein, at least two unique peptides had to 
be observed in both replicates for at least one cell type, but 
not in the HaloTag only samples. Seventy-three proteins 
were identified in this manner as potential Hl.O-binding 
partners (Table 1). Table 1 also includes 36 proteins that 
were identified by the RNAse analysis described later in the 
text (Supplementary Table SI). The calculated pi of human 
is H1.0 (10.9), and those of the candidate Hl-binding 
proteins ranged from 3.8 to 11.9, suggesting that the 
observed Hl.O-protein interactions do not result from 
non-specific electrostatic interactions. Most of the 
proteins pulled down by H1.0 were observed in one or 
two cell types (Table 1). Small nuclear ribonucleoprotein 
(snRNP) SmDl, splicing factor 3B subunits 1 and 3, U5 
snRNP 1 16 kDa subunit, PUF60, nucleolin, FACT subunit 
SSRP1, PABPC1, ribosomal proteins L4 and L7 and tran- 
sitional endoplasmic reticulum ATPase are notable as 
being observed in three or four cell types. 

Most of the candidate Hl-binding proteins can be 
grouped into four functional categories (Table 1). The 
first group comprises proteins involved in pre-mRNA 
splicing. Splicing is carried out by the spliceosome, 
which is composed of a host of core splicing factors and 
many other associated proteins (18,19). Thirty-three 
splicing factors were pulled down by H1.0, representing 
a significant fraction of the spliceosomal proteins and 
nearly one-third of the total Hl-binding proteins identified 
in our studies. A second major group of Hl.O-binding 
proteins functions in rRNA biogenesis, including 
nucleolin, nucleophosmin, FACT, casein kinase II, La 
and NolCl (Noppl40). The third category was related 
to translation and included 24 ribosomal proteins, eukary- 
otic translation initiation factor 3 and signal recognition 
particle subunits. The fourth grouping involved proteins 
that function in cellular transport, and included importin 
subunits, major vault protein and coatomer subunits. The 
remaining proteins were grouped as miscellaneous. 
Remarkably, 94 of the 107 Hl.O-binding proteins from 
Table 1 have been observed in proteomics analyses of 
the human nucleolus (20,21), suggesting a role for H1.0 
in ribosome biogenesis/function. 

We speculated that the disordered H1.0 CTD may 
mediate many of the observed protein-protein inter- 
actions. To address this question, we determined which 
proteins were pulled down by the H1ACTD construct. 
The interactions that were observed in at least one of 
the two H1ACTD replicates were classified as 
CTD-independent, whereas the interactions that were 
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Table 1. Candidate Hl.O-binding proteins grouped by common function 



Protein" Accession Identifier Cell type pi Nucleolar CTD- 

number dependent 



mRNA Splicing 

Small nuclear ribonucleoprotein Sm Dl 

Small nuclear ribonucleoprotein Sm D3 

Small nuclear ribonucleoprotein E 

Small nuclear ribonucleoprotein-associated protein N 

Splicing factor 3 A subunit 3 

Splicing factor 3B subunit 1 

Splicing factor 3B subunit 2 

Splicing factor 3B subunit 3 

Pre-mRNA branch site protein pl4 

Splicing factor U2AF 35kDa subunit 

Splicing factor U2AF 65 kDa subunit, isoform 2 

U2 small nuclear ribonucleoprotein A' 

U5 small nuclear ribonucleoprotein 116 kDa component 

U5 small nuclear ribonucleoprotein 200 kDa helicase 

Pre-mRNA-processing-splicing factor 8 

Serine/arginine-rich splicing factor 1 (SF2/ASF) 

Serine/arginine-rich splicing factor 2 

Poly(U)-binding-splicing factor PUF60, isoform 2 

Heterogeneous nuclear ribonucleoprotein A2/B1, isoform A2 

Heterogeneous nuclear ribomicleoproteins C1/C2, isoform CI 

Heterogeneous nuclear ribonucleoprotein D 

Heterogeneous nuclear ribonucleoprotein F 

Heterogeneous nuclear ribonucleoprotein G 

Heterogeneous nuclear ribonucleoprotein H 

Heterogeneous nuclear ribonucleoprotein M 

Heterogeneous nuclear ribonucleoprotein U 

Heterogeneous nuclear ribonucleoprotein U-like 1, isoform 2 

Nuclease-sensitive element-binding protein 1 

ATP-dependent RNA helicase DDX1 

Interleukin enhancer-binding factor 2 

Poly(rC)-binding protein 1 

Nucleoprotein TPR 

RNA binding protein 39, isoform 2 

rRNA biogenesis 
Nucleolin 

Nucleophosmin, isoform 2 
FACT complex subunit SPT16 
FACT complex subunit SSRP1 
Casein kinase II subunit a' 
Casein kinase II subunit oil 
Histone deacetylase 2 

Nucleolar and coiled-body phosphoprotein 1, isoform p 

ATP-dependent RNA helicase DDX5 

Nucleolar RNA helicase 2 

Lupus La 

Matrin 

Ribosome/translation 
40S ribosomal protein S2 
40S ribosomal protein S3a 
40S ribosomal protein S8 
40S ribosomal protein S9 
40S ribosomal protein S13 
40S ribosomal protein S15 
40S ribosomal protein S23 
40S ribosomal protein S24 
60S ribosomal protein P0 
60S ribosomal protein L3 
60S ribosomal protein L4 
60S ribosomal protein L5 
60S ribosomal protein L6 
60S ribosomal protein L7 
60S ribosomal protein L7a 
60S ribosomal protein L8 
60S ribosomal protein L10 
60S ribosomal protein L12 
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Table 1. Continued 
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''Proteins identified in at least two of the three control replicates in the RNAse analysis are shown in italics. The remainder of the proteins appeared 
in both replicates of at least one cell type in our initial analysis. 

b Proteins that were identified in the study conducted by either Andersen el al. (20) or Jarboui el al. (21). 



absent from both of the H1ACTD replicates were classi- 
fied as CTD-dependent. We found that 20 of the 73 
Hl.O-protein interactions identified in this experiment 
were dependent on the CTD (Table 1). These findings 
indicate that in addition to their well-established roles in 
DNA binding, the HI .0 CTD and NTD-GD both can serve 
as combinatorial protein-protein interaction modules. 

Many of the candidate HI .0-binding proteins detected in 
our studies interact with RNA or are components of 
ribonucleoprotein particles. Thus, it is possible that the 
Hl.O-protein interactions may be indirect and RNA- 
mediated. To address this question, Hl.O-binding 



proteins were identified in three separate pull-downs from 
control and RNAse-treated U20S nuclear extracts. Based 
on the criterion that two or more unique peptides were 
observed in at least two of the three repetitions, 
79 proteins were identified (Supplementary Table SI). Of 
these, 43 were observed in our initial analysis, and 36 
were new and added to Table 1. Fifty-eight of the 79 
proteins were detected in at least one of the three RNAse- 
treated samples, and they were scored as RNA- 
independent (Supplementary Table SI). U2AF65, PRP8, 
PUF60, SRSF1, nucleolin and heterogenous nuclear 
RNPs (hnRNPs) C1/C2, F and M all are RRM 
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Figure 1. RbAp48 interacts with the CTD of histone H1.0. Proteins 
were pulled down from isolated CEM nuclei and subjected to western 
blotting as described in 'Material and Methods' section. 

domain-containing RNA-binding proteins that were 
present in two or more of the RNAse-treated replicates. 
We conclude that there was no wholesale loss of H1.0- 
protein interactions because of RNAse treatment, and 
that H1.0 interacts directly with many of the RNA- 
binding proteins in the extracts. Twenty-one of the 
proteins detected in the control samples were absent from 
all three of the RNAse-treated extracts, suggesting that 
these interactions may be RNA-dependent. The 
RNAse-dependent interactions tended to involve known 
RNA-binding proteins and were distributed among all 
functional groupings. An interaction could be 
RNA-dependent if HI directly bound RNA that was com- 
plexed with other proteins, or if HI interacted with only 
one of the protein components of a multi-protein-RNA 
complex. Additional studies will be needed to determine 
the molecular basis of the RNAse-dependent interactions. 

Table 1 only includes the proteins that were identified in 
multiple replicates in our experiments. If we consider those 
cases in which two unique peptides were observed in only 
one replicate of one cell type, an additional 191 potential 
Hl-binding proteins were identified by LC-MS/MS 
(Supplementary Table S2). RbAp48 is a protein that was 
observed in one replicate and in only one cell type. 
Western blotting confirmed that RbAp48 was pulled 
down by H1.0, and that the interaction was abolished in 
the absence of the CTD (Figure 1). This result indicates 
that at least some of the proteins that were observed by 
mass spectrometry in only one replicate (Supplementary 
Table S2) also are legitimate Hl-binding partners. 

Validation of candidate Hl-protein interactions 

A definitive test of whether an interaction identified by 
pull-downs is direct is provided by biophysical character- 
ization in a purified system. We, therefore, examined 
whether purified recombinant H1.0 was able to bind to 
three candidate proteins from Table 1 using the recently 
developed high-throughput interactions by fluorescence 
intensity (HI-FI) system (17,22). For these fluorescence 
(de)quenching experiments, a fixed concentration of 
fluorescently labeled H1.0 was mixed with varying concen- 
trations of the recombinant protein in question. Changes 
in the fluorescence intensity as a function of candidate 
protein concentration were fit to a one-site model, 
allowing determination of the apparent dissociation 
constant (K D ) and Hill coefficient for the interaction. 

U2AF65 is a splicing factor pulled down by H1.0. 
U2AF65 is a subunit of the U2AF complex, which binds 
both proteins and pre-mRNA to facilitate splice site rec- 
ognition and the early stages of spliceosome assembly (23). 
When the binding of U2AF65 to H1.0 was examined by 
the fluorescence assay, the normalized fluorescence 



N 



0 



- ASF/SF2 






-•- U2AF65 






» FACT 




^^^^ ^^^^^^^^ 








i l .... TZ. . . 







1 10 100 1 000 10000 

Protein concentration (nM) 



Figure 2. Interactions between histone H1.0 and U2AF65, SF2/ASF 
and FACT are direct. Shown are the normalized fluorescence changes 
on titration of recombinant U2AF65 (dark grey circle), SF2/ASF (grey 
triangle) and FACT (black square) into fluorescently labeled H1.0 as 
described in 'Materials and Methods' section. The error bars represent 
the standard error within individual data points. 

quenching as a function of U2AF65 concentration was 
well fit by a single-site model (Figure 2). The K D for the 
interaction was 0.82 uM, and the Hill coefficient was 1. 
These data indicate that H1.0 binding to U2AF65 was 
direct and of moderate affinity. Serine/arginine-rich 
splicing factor 1, more commonly known as splicing 
factor 2/alternative splicing factor (SF2/ASF), is another 
spliceosomal protein pulled down by H1.0. The fluores- 
cence quenching data for SF2/ASF (residues 11-196) 
indicate that it also is capable of binding directly to 
H1.0 (Figure 2). The K D for the interaction was 2.4 uM, 
and the Hill coefficient was 2. The FACT complex is a 
histone chaperone that consists of the Sptl6 and SSRP1 
subunits (24). We observed that H1.0 bound to recombin- 
ant FACT with high affinity (K D = 0.032 uM) and a Hill 
coefficient of 1 (Figure 2). Taken together, the quantita- 
tive biophysical data from Figure 2 validated the prote- 
omics analysis for the proteins examined. Although Sptl6 
has a highly acidic C-terminal domain implicated in core 
histone binding (24), SSRP1, U2AF65 and SF2/ASF all 
lack such negatively charged regions. Thus, the mechan- 
ism of H 1 binding to these proteins does not seem to be 
purely electrostatic. Likewise, we could not find any 
obvious common sequence motifs in the four proteins. 
Determination of the mechanism(s) responsible for 
HI -mediated protein-protein interactions will be a 
productive area of future research. 

DISCUSSION 

Linker histones are ubiquitous chromatin-associated 
DNA-binding proteins (1,3). Although it is widely held 
that linker histones function by stabilizing the condensed 
states of chromatin (1,3,25,26), there is some evidence that 
they also act in part through specific protein-protein inter- 
actions (4). To define the scope of protein-protein inter- 
actions in linker histone function, we performed a 
systematic proteomic analysis of proteins that were 
pulled down from human nuclear extracts by full-length 
H1.0 and H1ACTD proteins. Our analyses identified 107 
Hl.O-binding proteins (Table 1). The number of H1.0- 
binding proteins probably is greater, as at least some of 
the 191 proteins observed in only one replicate are 
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legitimate HI -binding partners (Figure 1). Most of the 
observed Hl.O-protein interactions are RNA-independent 
(Supplementary Table SI). Although some of the H1.0- 
binding proteins may have been indirectly pulled down as 
part of multi-protein complexes, we observed direct inter- 
actions between H1.0 and all three of the proteins 
examined (U2AF65, SF2/ASF and FACT) (Figure 2). 
The unexpectedly large number of Hl.O-binding proteins 
identified by our studies documents an important role for 
protein-protein interactions in linker histone action and 
suggests a new paradigm for HI structure and function 
that extends well beyond its effects on chromatin 
architecture. 



The nucleolus, the site of ribosome biogenesis (27,28), is 
composed of as many as 900 proteins in humans, 
including the linker histones (20,21). Strikingly, of the 
107 proteins pulled down by H1.0, 94 have been identified 
as components of the nucleolus (Table 1). Most of the 
proteins detected in one replicate also are nucleolar 
(Supplementary Table S2). This suggests that the nucle- 
olus may be a primary source of the HI -binding proteins 
in the nuclear extracts and raises the intriguing possibility 
that HI is a key regulator of nucleolar function. In 
support of this notion, H1.0 pulled down numerous 
proteins involved in mRNA splicing, rRNA synthesis 
and processing and ribosome function. Interestingly, 
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Figure 3. STRING analysis of the H1.0 interactome. The identifiers for the 107 proteins from Table 1 were entered into the STRING database 
(http://string.embl.de) (33). The confidence level was set to 0.4 (medium). Shown is the evidence view. Line colors represent the different types of 
evidence for the indicated association. 
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H1.0 previously has been shown to surround the nucleolus 
(29), whereas phosphorylated HI. 2 and HI. 4 localize to 
the nucleolus in vivo and are associated with increased 
RNA Pol I activity and rRNA biosynthesis (30). The in- 
volvement of rRNA synthesis/processing and ribosome 
biogenesis in nucleolar function is well established 
(27,28). The link to mRNA splicing may come from 
small nucleolar RNAs (snoRNAs), which are spliced out 
of the introns of host genes (31) and assembled into 
snoRNPs that modify and process rRNA (32). To deter- 
mine whether the Hl.O-binding proteins are functionally 
interrelated, the proteins from Table 1 were analyzed by 
the STRING engine (33). Results indicate that nearly all 
of the proteins pulled down by H1.0 are part of a 
well-defined interaction network (Figure 3). The analysis 
clearly identifies the spliceosome and the ribosome, reflect- 
ing the large number of splicing factors and ribosomal 
proteins pulled down by H1.0. Connecting the spliceo- 
some and ribosome clusters are numerous proteins that 
function in rRNA synthesis and processing. Several 
proteins in particular stand out as central 'hubs', including 
nucleolin, nucleophosmin (B38), FACT, casein kinase II, 
La (SSB) and the RNA helicase DDX1 (Figure 3). Given 
that HI interacts with so many multifunctional nucleolar 
proteins, the potential for regulation is enormous, as the 
interplay between protein concentrations and equilibrium 
constants will dictate which Hl-protein interactions 
dominate under any given set of conditions in vivo. We 
note that FRAP studies have shown that there are 
multiple kinetic classes of HI in the nucleus (34,35). In 
view of our findings, it seems reasonable to propose that 
the nucleolus may be the source of the slower exchanging 
fraction of HI. 

Examination of the specific spliceosomal proteins pulled 
down by H 1.0 provides insight into how HI may regulate 
spliceosome function. Recognition of 5' and 3' splice sites 
in the pre-mRNA is a key step in the splicing process. 
Splice site recognition involves recruitment of U2AF35 
and U2AF65 to the 3' splice site and the Ul snRNP 
particle to the 5' splice site by the serine/arginine-rich 
splicing factors (SR proteins) (23). The SR proteins 
regulate both constitutive and alternative splicing (36). 
H1.0 pulled down U2AF35, U2AF65 and two SR 
proteins (Table 1). In addition, two Ul snRNP proteins 
and six other SRs were observed in one replicate. Direct 
interactions between H1.0 and both U2AF65 and SF2/ 
ASF were rigorously confirmed by biophysical studies of 
pure recombinant proteins (Figure 2). The hnRNP 
proteins repress splice site recognition by competing for 
the same pre-RNA-binding sites as the SR proteins (23). 
H1.0 pulled down nine hnRNPs, and five more hnRNPs 
were observed in one replicate. Our results suggest that 
linker histones may regulate mRNA splice site recognition 
via interactions with SRs, U2AF and hnRNPs. H1.0 also 
pulled down two U2 snRNP components and four 
subunits of the U2-associated complex, SF3 (Table 1). 
This suggests that H1.0 also may be involved in U2 
snRNP function. 

In addition to the functional ramifications discussed 
earlier in the text, our studies have provided structural 
insight into HI -mediated protein-protein interactions. 



About 25% of the candidate Hl-protein interactions 
were found to be dependent on the HI CTD and 75% 
on the NTD-GD fragment (Table 1). The CTD is an 
intrinsically disordered domain whose function has been 
linked to its unique amino acid composition (the isoform 
CTDs consists of 38-42% lysine, 12-14% proline, 
18-34% alanine and essentially no aromatic residues) 
(8,37). Despite having low-sequence complexity, intrinsic- 
ally disordered protein domains often act as combinatorial 
protein-protein interaction modules, that is, the same 
domain is able to specifically interact with many different 
proteins (38). This has been termed 'fuzziness' (39). Our 
findings are entirely consistent with this concept. There is 
biochemical evidence for CTD-dependent protein-protein 
interactions as well (40,41). Perhaps more unexpected was 
the large number of interactions that were mediated by the 
NTD-GD fragment. The GD has a winged helix fold (5). 
Although most winged helix motifs are DNA-binding 
domains, several are known to mediate protein-protein 
interactions (42). The linker histone GD seems to fall 
into a unique class that can mediate both protein- 
protein and protein-DNA interactions. Finally, it is 
possible that the disordered NTD mediates some, if not 
many, of the Hl.O-protein interactions identified in the 
H1ACTD samples. 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online: 
Supplementary Tables 1-3. 
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